Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 3955 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 364.2 KiB |
| Average record size in memory | 94.3 B |
Variable types
| Categorical | 5 |
|---|---|
| Numeric | 9 |
| Boolean | 3 |
rental_year has constant value "2021" | Constant |
rental_date has a high cardinality: 212 distinct values | High cardinality |
rental_month is highly correlated with temp | High correlation |
temp is highly correlated with rental_month | High correlation |
rental_month is highly correlated with temp | High correlation |
temp is highly correlated with rental_month | High correlation |
rental_hour is highly correlated with rental_year and 1 other fields | High correlation |
rental_day is highly correlated with rental_year and 1 other fields | High correlation |
rental_month is highly correlated with rental_year and 1 other fields | High correlation |
rental_year is highly correlated with rental_hour and 3 other fields | High correlation |
dayofweek_n is highly correlated with rain | High correlation |
rain is highly correlated with rental_hour and 4 other fields | High correlation |
working_day is highly correlated with dayofweek and 2 other fields | High correlation |
dayofweek is highly correlated with working_day and 1 other fields | High correlation |
peak is highly correlated with working_day and 1 other fields | High correlation |
rental_year is highly correlated with working_day and 5 other fields | High correlation |
holiday is highly correlated with rental_year | High correlation |
season is highly correlated with rental_year | High correlation |
timesofday is highly correlated with rental_year | High correlation |
rental_hour is highly correlated with peak and 3 other fields | High correlation |
rental_month is highly correlated with season and 1 other fields | High correlation |
dayofweek_n is highly correlated with dayofweek and 1 other fields | High correlation |
dayofweek is highly correlated with dayofweek_n and 1 other fields | High correlation |
working_day is highly correlated with dayofweek_n and 2 other fields | High correlation |
season is highly correlated with rental_month and 1 other fields | High correlation |
peak is highly correlated with rental_hour and 1 other fields | High correlation |
timesofday is highly correlated with rental_hour | High correlation |
temp is highly correlated with rental_month and 1 other fields | High correlation |
rhum is highly correlated with rental_hour | High correlation |
count is highly correlated with rental_hour | High correlation |
rental_date is uniformly distributed | Uniform |
rental_hour has 131 (3.3%) zeros | Zeros |
dayofweek_n has 562 (14.2%) zeros | Zeros |
rain has 3582 (90.6%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-09 10:05:39.893642 |
|---|---|
| Analysis finished | 2022-04-09 10:06:09.261058 |
| Duration | 29.37 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 212 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| 2021-07-24 | 24 |
|---|---|
| 2021-06-27 | 23 |
| 2021-06-20 | 23 |
| 2021-08-01 | 23 |
| 2021-08-24 | 22 |
| Other values (207) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-02-01 |
|---|---|
| 2nd row | 2021-02-01 |
| 3rd row | 2021-02-01 |
| 4th row | 2021-02-01 |
| 5th row | 2021-02-01 |
Common Values
| Value | Count | Frequency (%) |
| 2021-07-24 | 24 | 0.6% |
| 2021-06-27 | 23 | 0.6% |
| 2021-06-20 | 23 | 0.6% |
| 2021-08-01 | 23 | 0.6% |
| 2021-08-24 | 22 | 0.6% |
| 2021-08-12 | 22 | 0.6% |
| 2021-06-05 | 22 | 0.6% |
| 2021-06-06 | 22 | 0.6% |
| 2021-06-11 | 22 | 0.6% |
| 2021-08-29 | 22 | 0.6% |
| Other values (202) | 3730 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 2021-07-24 | 24 | 0.6% |
| 2021-06-20 | 23 | 0.6% |
| 2021-08-01 | 23 | 0.6% |
| 2021-06-27 | 23 | 0.6% |
| 2021-06-11 | 22 | 0.6% |
| 2021-06-19 | 22 | 0.6% |
| 2021-08-29 | 22 | 0.6% |
| 2021-06-12 | 22 | 0.6% |
| 2021-06-06 | 22 | 0.6% |
| 2021-06-05 | 22 | 0.6% |
| Other values (202) | 3730 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.19342604 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 131 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 14 |
| Q3 | 18 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 6.156018827 |
|---|---|
| Coefficient of variation (CV) | 0.4665974408 |
| Kurtosis | -0.7010877986 |
| Mean | 13.19342604 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.3353604129 |
| Sum | 52180 |
| Variance | 37.8965678 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) |
| 17 | 212 | 5.4% |
| 13 | 212 | 5.4% |
| 18 | 211 | 5.3% |
| 19 | 210 | 5.3% |
| 12 | 210 | 5.3% |
| 16 | 209 | 5.3% |
| 14 | 208 | 5.3% |
| 10 | 208 | 5.3% |
| 11 | 207 | 5.2% |
| 15 | 207 | 5.2% |
| Other values (14) | 1861 |
| Value | Count | Frequency (%) |
| 0 | 131 | |
| 1 | 88 | |
| 2 | 81 | 2.0% |
| 3 | 48 | 1.2% |
| 4 | 38 | 1.0% |
| 5 | 39 | 1.0% |
| 6 | 124 | |
| 7 | 170 | |
| 8 | 206 | |
| 9 | 206 |
| Value | Count | Frequency (%) |
| 23 | 153 | |
| 22 | 183 | |
| 21 | 192 | |
| 20 | 202 | |
| 19 | 210 | |
| 18 | 211 | |
| 17 | 212 | |
| 16 | 209 | |
| 15 | 207 | |
| 14 | 208 |
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.71908976 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.719058433 |
|---|---|
| Coefficient of variation (CV) | 0.5546796008 |
| Kurtosis | -1.192594954 |
| Mean | 15.71908976 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.01522254709 |
| Sum | 62169 |
| Variance | 76.02197995 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=31)
| Value | Count | Frequency (%) |
| 24 | 142 | 3.6% |
| 27 | 140 | 3.5% |
| 25 | 139 | 3.5% |
| 20 | 138 | 3.5% |
| 19 | 136 | 3.4% |
| 18 | 136 | 3.4% |
| 5 | 134 | 3.4% |
| 26 | 133 | 3.4% |
| 13 | 133 | 3.4% |
| 7 | 133 | 3.4% |
| Other values (21) | 2591 |
| Value | Count | Frequency (%) |
| 1 | 126 | |
| 2 | 129 | |
| 3 | 130 | |
| 4 | 128 | |
| 5 | 134 | |
| 6 | 128 | |
| 7 | 133 | |
| 8 | 132 | |
| 9 | 128 | |
| 10 | 122 |
| Value | Count | Frequency (%) |
| 31 | 60 | |
| 30 | 108 | |
| 29 | 116 | |
| 28 | 123 | |
| 27 | 140 | |
| 26 | 133 | |
| 25 | 139 | |
| 24 | 142 | |
| 23 | 129 | |
| 22 | 131 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.142604298 |
| Minimum | 2 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 8 |
| Maximum | 8 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.98410293 |
|---|---|
| Coefficient of variation (CV) | 0.3858167602 |
| Kurtosis | -1.234041566 |
| Mean | 5.142604298 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.08256495849 |
| Sum | 20339 |
| Variance | 3.936664435 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 8 | 618 | |
| 7 | 600 | |
| 6 | 599 | |
| 5 | 566 | |
| 3 | 555 | |
| 4 | 536 | |
| 2 | 481 |
| Value | Count | Frequency (%) |
| 2 | 481 | |
| 3 | 555 | |
| 4 | 536 | |
| 5 | 566 | |
| 6 | 599 | |
| 7 | 600 | |
| 8 | 618 |
| Value | Count | Frequency (%) |
| 8 | 618 | |
| 7 | 600 | |
| 6 | 599 | |
| 5 | 566 | |
| 4 | 536 | |
| 3 | 555 | |
| 2 | 481 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| 2021 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021 |
|---|---|
| 2nd row | 2021 |
| 3rd row | 2021 |
| 4th row | 2021 |
| 5th row | 2021 |
Common Values
| Value | Count | Frequency (%) |
| 2021 | 3955 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 2021 | 3955 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| False | |
|---|---|
| True | 170 |
| Value | Count | Frequency (%) |
| False | 3785 | |
| True | 170 | 4.3% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.015423515 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 562 |
| Zeros (%) | 14.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.003667551 |
|---|---|
| Coefficient of variation (CV) | 0.6644730138 |
| Kurtosis | -1.25948173 |
| Mean | 3.015423515 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.01552412483 |
| Sum | 11926 |
| Variance | 4.014683653 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 5 | 582 | |
| 4 | 572 | |
| 1 | 568 | |
| 6 | 566 | |
| 0 | 562 | |
| 3 | 554 | |
| 2 | 551 |
| Value | Count | Frequency (%) |
| 0 | 562 | |
| 1 | 568 | |
| 2 | 551 | |
| 3 | 554 | |
| 4 | 572 | |
| 5 | 582 | |
| 6 | 566 |
| Value | Count | Frequency (%) |
| 6 | 566 | |
| 5 | 582 | |
| 4 | 572 | |
| 3 | 554 | |
| 2 | 551 | |
| 1 | 568 | |
| 0 | 562 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.3 KiB |
| Saturday | |
|---|---|
| Friday | |
| Tuesday | |
| Sunday | |
| Monday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.136030341 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Monday |
|---|---|
| 2nd row | Monday |
| 3rd row | Monday |
| 4th row | Monday |
| 5th row | Monday |
Common Values
| Value | Count | Frequency (%) |
| Saturday | 582 | |
| Friday | 572 | |
| Tuesday | 568 | |
| Sunday | 566 | |
| Monday | 562 | |
| Thursday | 554 | |
| Wednesday | 551 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| saturday | 582 | |
| friday | 572 | |
| tuesday | 568 | |
| sunday | 566 | |
| monday | 562 | |
| thursday | 554 | |
| wednesday | 551 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 2675 | |
| False | 1280 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| Spring | |
|---|---|
| Summer | |
| Winter |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Winter |
|---|---|
| 2nd row | Winter |
| 3rd row | Winter |
| 4th row | Winter |
| 5th row | Winter |
Common Values
| Value | Count | Frequency (%) |
| Spring | 1704 | |
| Summer | 1414 | |
| Winter | 837 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| spring | 1704 | |
| summer | 1414 | |
| winter | 837 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 2587 | |
| True | 1368 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 KiB |
| Afternoon | |
|---|---|
| Evening | |
| Morning | |
| Night |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.281163085 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night |
|---|---|
| 2nd row | Morning |
| 3rd row | Morning |
| 4th row | Morning |
| 5th row | Morning |
Common Values
| Value | Count | Frequency (%) |
| Afternoon | 1258 | |
| Evening | 998 | |
| Morning | 997 | |
| Night | 702 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| afternoon | 1258 | |
| evening | 998 | |
| morning | 997 | |
| night | 702 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 32 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05638432364 |
| Minimum | 0 |
|---|---|
| Maximum | 10.3 |
| Zeros | 3582 |
| Zeros (%) | 90.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.3 |
| Maximum | 10.3 |
| Range | 10.3 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3284465882 |
|---|---|
| Coefficient of variation (CV) | 5.825140163 |
| Kurtosis | 303.7377589 |
| Mean | 0.05638432364 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.76925744 |
| Sum | 223 |
| Variance | 0.1078771613 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=32)
| Value | Count | Frequency (%) |
| 0 | 3582 | |
| 0.1 | 121 | 3.1% |
| 0.2 | 52 | 1.3% |
| 0.3 | 32 | 0.8% |
| 0.4 | 29 | 0.7% |
| 0.6 | 22 | 0.6% |
| 0.5 | 21 | 0.5% |
| 0.9 | 14 | 0.4% |
| 0.7 | 11 | 0.3% |
| 1.1 | 11 | 0.3% |
| Other values (22) | 60 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 3582 | |
| 0.1 | 121 | 3.1% |
| 0.2 | 52 | 1.3% |
| 0.3 | 32 | 0.8% |
| 0.4 | 29 | 0.7% |
| 0.5 | 21 | 0.5% |
| 0.6 | 22 | 0.6% |
| 0.7 | 11 | 0.3% |
| 0.8 | 8 | 0.2% |
| 0.9 | 14 | 0.4% |
| Value | Count | Frequency (%) |
| 10.3 | 1 | < 0.1% |
| 5.5 | 1 | < 0.1% |
| 5.1 | 1 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 3.6 | 1 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 3.3 | 1 | < 0.1% |
| 2.8 | 4 | |
| 2.7 | 2 | |
| 2.6 | 2 |
| Distinct | 277 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.53605563 |
| Minimum | -4 |
|---|---|
| Maximum | 26.3 |
| Zeros | 3 |
| Zeros (%) | 0.1% |
| Negative | 35 |
| Negative (%) | 0.9% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | -4 |
|---|---|
| 5-th percentile | 2.7 |
| Q1 | 7.9 |
| median | 11.6 |
| Q3 | 15.3 |
| 95-th percentile | 19.5 |
| Maximum | 26.3 |
| Range | 30.3 |
| Interquartile range (IQR) | 7.4 |
Descriptive statistics
| Standard deviation | 5.190960843 |
|---|---|
| Coefficient of variation (CV) | 0.4499770989 |
| Kurtosis | -0.329447302 |
| Mean | 11.53605563 |
| Median Absolute Deviation (MAD) | 3.7 |
| Skewness | -0.05600268498 |
| Sum | 45625.1 |
| Variance | 26.94607447 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 10.1 | 46 | 1.2% |
| 8.9 | 42 | 1.1% |
| 13.6 | 38 | 1.0% |
| 10.6 | 37 | 0.9% |
| 14.9 | 37 | 0.9% |
| 7.6 | 36 | 0.9% |
| 13.2 | 36 | 0.9% |
| 14.3 | 35 | 0.9% |
| 10.7 | 35 | 0.9% |
| 16.3 | 35 | 0.9% |
| Other values (267) | 3578 |
| Value | Count | Frequency (%) |
| -4 | 1 | < 0.1% |
| -3.4 | 1 | < 0.1% |
| -3.3 | 1 | < 0.1% |
| -2.9 | 2 | |
| -2.5 | 1 | < 0.1% |
| -2.1 | 1 | < 0.1% |
| -2 | 1 | < 0.1% |
| -1.9 | 4 | |
| -1.7 | 2 | |
| -1.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 26.3 | 3 | |
| 26.2 | 1 | < 0.1% |
| 25.9 | 1 | < 0.1% |
| 25.7 | 2 | |
| 25.6 | 1 | < 0.1% |
| 25.4 | 3 | |
| 25.3 | 2 | |
| 25.2 | 1 | < 0.1% |
| 25.1 | 2 | |
| 25 | 1 | < 0.1% |
| Distinct | 69 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.08141593 |
| Minimum | 24 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 24 |
|---|---|
| 5-th percentile | 54 |
| Q1 | 68 |
| median | 78 |
| Q3 | 88 |
| 95-th percentile | 96 |
| Maximum | 100 |
| Range | 76 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.25417624 |
|---|---|
| Coefficient of variation (CV) | 0.1719503472 |
| Kurtosis | -0.479698594 |
| Mean | 77.08141593 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.4019158076 |
| Sum | 304857 |
| Variance | 175.6731877 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 79 | 120 | 3.0% |
| 88 | 117 | 3.0% |
| 86 | 115 | 2.9% |
| 87 | 115 | 2.9% |
| 93 | 111 | 2.8% |
| 71 | 109 | 2.8% |
| 74 | 106 | 2.7% |
| 73 | 106 | 2.7% |
| 82 | 104 | 2.6% |
| 84 | 102 | 2.6% |
| Other values (59) | 2850 |
| Value | Count | Frequency (%) |
| 24 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 39 | 2 | |
| 40 | 4 | |
| 41 | 3 |
| Value | Count | Frequency (%) |
| 100 | 38 | 1.0% |
| 99 | 28 | 0.7% |
| 98 | 40 | 1.0% |
| 97 | 68 | |
| 96 | 71 | |
| 95 | 89 | |
| 94 | 92 | |
| 93 | 111 | |
| 92 | 97 | |
| 91 | 81 |
wdsp
Real number (ℝ≥0)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.68369153 |
| Minimum | 1 |
|---|---|
| Maximum | 26 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 17 |
| Maximum | 26 |
| Range | 25 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.303581518 |
|---|---|
| Coefficient of variation (CV) | 0.4955935507 |
| Kurtosis | 0.3861786968 |
| Mean | 8.68369153 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.7864481832 |
| Sum | 34344 |
| Variance | 18.52081389 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=26)
| Value | Count | Frequency (%) |
| 7 | 436 | |
| 6 | 403 | |
| 8 | 397 | |
| 5 | 325 | 8.2% |
| 9 | 315 | 8.0% |
| 4 | 291 | 7.4% |
| 10 | 285 | 7.2% |
| 11 | 241 | 6.1% |
| 3 | 224 | 5.7% |
| 12 | 177 | 4.5% |
| Other values (16) | 861 |
| Value | Count | Frequency (%) |
| 1 | 20 | 0.5% |
| 2 | 105 | 2.7% |
| 3 | 224 | |
| 4 | 291 | |
| 5 | 325 | |
| 6 | 403 | |
| 7 | 436 | |
| 8 | 397 | |
| 9 | 315 | |
| 10 | 285 |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 25 | 4 | 0.1% |
| 24 | 2 | 0.1% |
| 23 | 4 | 0.1% |
| 22 | 15 | 0.4% |
| 21 | 24 | 0.6% |
| 20 | 29 | |
| 19 | 35 | |
| 18 | 49 | |
| 17 | 71 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.360556258 |
| Minimum | 1 |
|---|---|
| Maximum | 26 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 13 |
| Maximum | 26 |
| Range | 25 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.854650765 |
|---|---|
| Coefficient of variation (CV) | 0.7190766367 |
| Kurtosis | 1.000780293 |
| Mean | 5.360556258 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.031947252 |
| Sum | 21201 |
| Variance | 14.85833252 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) |
| 1 | 649 | |
| 2 | 489 | |
| 3 | 429 | |
| 4 | 385 | |
| 5 | 355 | |
| 6 | 328 | |
| 7 | 304 | |
| 8 | 243 | 6.1% |
| 9 | 190 | 4.8% |
| 10 | 155 | 3.9% |
| Other values (14) | 428 |
| Value | Count | Frequency (%) |
| 1 | 649 | |
| 2 | 489 | |
| 3 | 429 | |
| 4 | 385 | |
| 5 | 355 | |
| 6 | 328 | |
| 7 | 304 | |
| 8 | 243 | 6.1% |
| 9 | 190 | 4.8% |
| 10 | 155 | 3.9% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 24 | 2 | 0.1% |
| 23 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 20 | 6 | 0.2% |
| 19 | 10 | 0.3% |
| 18 | 8 | 0.2% |
| 17 | 17 | |
| 16 | 19 | |
| 15 | 32 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| rental_date | rental_hour | rental_day | rental_month | rental_year | holiday | dayofweek_n | dayofweek | working_day | season | peak | timesofday | rain | temp | rhum | wdsp | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2021-02-01 | 6 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | True | Night | 0.0 | 3.4 | 98.0 | 3 | 1 |
| 1 | 2021-02-01 | 8 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | True | Morning | 0.0 | 3.5 | 93.0 | 4 | 2 |
| 2 | 2021-02-01 | 9 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | True | Morning | 0.0 | 2.6 | 93.0 | 2 | 4 |
| 3 | 2021-02-01 | 10 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | True | Morning | 0.0 | 4.1 | 97.0 | 4 | 3 |
| 4 | 2021-02-01 | 11 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | False | Morning | 0.0 | 5.2 | 86.0 | 6 | 12 |
| 5 | 2021-02-01 | 12 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | False | Afternoon | 0.0 | 5.2 | 89.0 | 7 | 9 |
| 6 | 2021-02-01 | 13 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | False | Afternoon | 0.0 | 6.0 | 85.0 | 7 | 9 |
| 7 | 2021-02-01 | 14 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | False | Afternoon | 0.0 | 5.9 | 88.0 | 9 | 12 |
| 8 | 2021-02-01 | 15 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | True | Afternoon | 0.0 | 5.7 | 88.0 | 10 | 6 |
| 9 | 2021-02-01 | 16 | 1 | 2 | 2021 | False | 0 | Monday | True | Winter | True | Afternoon | 0.0 | 5.3 | 89.0 | 10 | 5 |
Last rows
| rental_date | rental_hour | rental_day | rental_month | rental_year | holiday | dayofweek_n | dayofweek | working_day | season | peak | timesofday | rain | temp | rhum | wdsp | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3945 | 2021-08-31 | 11 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | False | Morning | 0.0 | 14.5 | 72.0 | 13 | 5 |
| 3946 | 2021-08-31 | 12 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | False | Afternoon | 0.0 | 13.6 | 74.0 | 10 | 2 |
| 3947 | 2021-08-31 | 13 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | False | Afternoon | 0.0 | 13.8 | 70.0 | 10 | 1 |
| 3948 | 2021-08-31 | 15 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | True | Afternoon | 0.0 | 14.2 | 73.0 | 11 | 4 |
| 3949 | 2021-08-31 | 16 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | True | Afternoon | 0.0 | 13.6 | 74.0 | 12 | 4 |
| 3950 | 2021-08-31 | 17 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | True | Afternoon | 0.0 | 13.7 | 73.0 | 11 | 6 |
| 3951 | 2021-08-31 | 18 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | True | Evening | 0.0 | 13.4 | 72.0 | 10 | 6 |
| 3952 | 2021-08-31 | 19 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | True | Evening | 0.0 | 13.1 | 69.0 | 10 | 3 |
| 3953 | 2021-08-31 | 20 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | False | Evening | 0.0 | 13.0 | 78.0 | 7 | 2 |
| 3954 | 2021-08-31 | 23 | 31 | 8 | 2021 | False | 1 | Tuesday | True | Summer | False | Night | 0.0 | 13.5 | 87.0 | 7 | 2 |